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BACKGROUND OF THE INVENTION 



1. Field of the Invention 

5 

This invention is related to the field of computer networks and, more particularly, 
to maintaining the high availability in a two node computer network which utilizes 
alternate pathing technology. 

10 2. Description of the Related Art 

With the ever expanding use of computer networks throughout society has come 
an increasing dependence of users on the availability of that network. If a network goes 
down, or is otherwise unavailable, costs to an enterprise may be significant. 

1 5 Consequently, a number of techniques have arisen which are designed to ensure that a 
computer network is sufficiently robust that it may detect and respond to problems 
without significantly impacting users. Frequently, efforts to ensure a computer network is 
consistently online for its users may be referred to as maintaining "high availability". A 
computer network which has in place mechanisms which prevent hardware or software 

20 problems from impacting its users may be referred to as a High Availability Network 
(HAnet). Some of the characteristics which may be considered when defining a HAnet 
include protection of data (Reliability), continuous access to data (Availability), and 
techniques for correcting problems which minimally impact users (Serviceability). 
Collectively these characteristics are frequently referred to as RAS. 

25 

In some cases it is desirable to create a computer network which includes a two 
node Local Area Network (LAN). For example, it may be desirable to have a two node 
LAN consisting of a database server and its corresponding application server. These 
servers may be connected to each other using a well known method using crossover 

l 



cables. A crossover cable is a cable that is used to connect two computers by reversing 
their respective pin contacts. Using crossover cables may have the advantage of being 
highly secure, performing well, and eliminating several components typically present in a 
computer network, such as switches and routers, which could cause a failure. However, 
5 while such a configuration may improve reliability and availability in the system, it does 
not address serviceability and still contains single points of failure. For example, failure 
of either of the server network interfaces to which the crossover cable is connected will 
cause the network to be unavailable. Also, failure of the crossover cable itself would 
result in unavailability of the network. In some cases, mechanisms may be put in place 

1 0 which detect an error in a network connection and notify the system administrator that a 
problem exists. The system administrator may then take corrective action, such as 
switching to a redundant resource. However, such mechanisms typically take some period 
of time and necessarily involve interruptions in network operation. In other cases, 
operating system specific mechanisms may be implemented which may facilitate a 

1 5 failover to a redundant connection. Typically these mechanisms operate at layers below 
the application layer of the protocol stack. Two widely recognized protocols include 
TCP/IP and ISO/OSI, each of which include a highest layer referred to as the application 
layer. Other communication protocols with a layer corresponding to the application layer 
may utilize a different name. Generally, those layers below the application layer involve 

20 software and mechanisms which are not portable across different operating systems. 
Consequently, these solutions are not portable and generally require a newly created 
mechanism for each platform on which a failover is desired. 

One technology which provides for redundancy in case of failure is alternate 
25 pathing. Alternate pathing is a technology which provides for redundancy to storage in 
case of a failed I/O controller. In addition to providing for recovery after failure, alternate 
pathing may also be used to support dynamic reconfiguration. Dynamic reconfiguration is 
used to logically attach and detach system boards from a running operating system. In 
addition to providing redundancy to storage, alternate pathing may also be used with 
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network connections. However, alternate pathing does not support automatic failover for 
network connections. Consequently, the problems described above still remain. 

SUMMARY OF THE INVENTION 

5 

The problems outlined above are in large part solved by a method and mechanism 
as described herein. A method and mechanism of failover in a system with alternate 
pathing is described. By utilizing an Application layer mechanism which monitors the 
primary network connection, automatically detects a failure in the primary connection, 

10 and switches to the secondary connection in a short period of time, network availability 
may be maintained. Advantageously, network interruptions may be minimized and 
servicing of network problems may be automated by a mechanism which is portable 
across multiple platforms. Further, because the mechanism operates within the 
application layer of the communication protocol, no modification of existing operating 

1 5 software is necessary. 

Broadly speaking, a method for maintaining high availability in a two node 
computer network utilizing alternate pathing is contemplated. The method includes 
adding an Application layer High Availability Networking (HAnet) mechanism to a node 
20 of the computer network, monitoring a first network connection, detecting a failure of the 
first network connection, and performing a failover from the first network connection to 
the second network connection. The monitoring, failure detection, and failover are all 
performed by the HAnet mechanism. 

25 Also contemplated is a network node configured to support alternate pathing 

which includes a first network interface, a second network interface, and a High 
Availability Networking (HAnet) mechanism. The included HAnet mechanism operates 
at the Application layer and is configured to monitor the first network interface. If a 
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failure of the first network interface is detected, the HAnet mechanism is configured to 
perform a failover from the first network interface to the second network interface. 

Further contemplated is a two node computer network configured to support 
alternate pathing and to maintain high availability. The network includes a first node 
coupled to a second node by two paths. The first node includes a High Availability 
Networking (HAnet) mechanism which operates at the Application layer. The HAnet 
mechanism is configured to monitor the first path and perform a failover from the first 
path to the alternate path in response to detecting a failure of the first path. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Other objects and advantages of the invention will become apparent upon reading 
the following detailed description and upon reference to the accompanying drawings in 
which: 

Fig. 1 is a diagram of a computer network. 

Fig. 2 is a diagram of a two node local area network. 

Fig. 3 is a diagram of a two node network with a redundant connection. 

Fig. 4 is a diagram of a two node network with a failover mechanism. 

Fig. 5 is a diagram of a flowchart illustrating a failover method. 

Fig. 6 is a diagram of a computer network with a firewall. 

Fig. 7 is a diagram of a computer network with a firewall and failover mechanism. 



Fig, 8 is a diagram of a computer network with trunking. 

Fig. 9 is a diagram of a computer network with trunking and a failover 
5 mechanism. 

Fig. 10 is a diagram of a flowchart illustrating a failover method. 

Fig. 1 1 is a diagram of a computer network with alternate pathing. 

10 

Fig. 12 is a diagram of a computer network with alternate pathing and a failover 
mechanism. 

Fig. 13 is a diagram of a computer network utilizing virtual IP addresses and a 
1 5 failover mechanism. 

While the invention is susceptible to various modifications and alternative forms, 
specific embodiments thereof are shown by way of example in the drawings and will 
herein be described in detail. It should be understood, however, that the drawings and 
20 detailed description thereto are not intended to limit the invention to the particular form 
disclosed, but on the contrary, the intention is to cover all modifications, equivalents and 
alternatives falling within the spirit and scope of the present invention as defined by the 
appended claims. 

25 DETAILED DESCRIPTION OF THE INVENTION 

Computer Network Overview 
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Fig. 1 is an illustration of a computer network 100. Included in computer network 
100 are database server 104, application server 102A, application server 102B, network 
infrastructure 108, and workstation 110. Database server 104 is coupled to application 
servers 102A and 102B. Application server 102A and 102B are further coupled to 
5 network infrastructure 108. Workstation 108 is also coupled to network infrastructure 
108. In addition, database server 104 includes network interfaces 120B and 120C, 
application server 102A includes network interfaces 120A and 120F, and application 
server 102B includes network interfaces 120D and 120E. Elements referred to herein with 
a particular reference number followed by a letter will be collectively referred to by the 
10 reference number alone. For example, interfaces 120A-120F will be collectively referred 
to as interfaces 120. In one embodiment, interfaces 120 are network interface cards 
(NIC). 



In computer network 100 of fig. 1, users may access the network 100 via 
1 5 workstations, such as workstation 110, coupled to network infrastructure 1 08. In one 
embodiment, a user on a workstation 1 10 may interact directly with a "front-end" 
interface, which in turn communicates with an application on application server 102 A. 
The application on application server 102A may then access required data from data base 
server 104. Application server 102 A may be viewed as in intermediary between the front- 
20 end workstation 110 and the back-end database 104. By acting as an intermediary, 

application server 102 A prevents users from directly accessing database server 104 and 
may better balance the load on database server 104. While there are advantages to 
utilizing the two node structure of application server 102A and database server 104, 
having a single connection between the two includes the existence of "single points of 
25 failure" as discussed in fig. 2. A single point of failure may be described as a point in the 
network, for example a device or connection, which has no backup and whose failure will 
cause an interruption in network service. 
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Fig. 2 is an illustration of a two node network. Included in fig. 2 are database 
server 202 and application server 204. Database server 202 includes NIC 220B. 
Application server 204 includes NIC 220A and 220C. In addition, application server 204 
includes Chinese font application 260. NIC 220A is coupled to NIC 220B via 
5 interconnect 240. In one embodiment, interconnect 240 is a crossover cable. In the 
example of fig. 2, a user may access application server 204 via interconnect 250. A user 
may require access to application server 204 for the purpose of utilizing Chinese font 
application 260. In communicating with a user, application server 204 may access 
database server 202. 

10 

As shown in fig. 2 5 there exists a single interconnect 240 between application 
server 204 and database server 202. If that single interconnect 240 fails, communication 
between application server 204 and database server 202 will be lost. Likewise, if either 
interface NIC 220A or NIC 220B fails, communication between application server 204 

1 5 and database server 202 will be lost. Consequently, each of these components 240, 220 A 
and 220B represents a single point of failure as described above. Further, because there is 
no backup connection between server 202 and 204, if this single connection 230 fails the 
only solution is to replace or repair the defective component. In any event, the failure of 
this single connection 230 will result in interruption of network service. In order to 

20 address this problem, a redundant connection may be introduced as discussed in the 
following. 



Fig. 3 is an illustration of a two node network which includes a redundant 
connection. Included in the network are a primary connection 330A and a secondary 
25 connection 330B. Primary connection 330A includes NIC 320A, NIC 320B, and 
interconnect 340. Secondary connection 330B includes NIC 322A, NIC 322B, and 
interconnect 342. In one embodiment, primary connection 330A serves to transmit 
communications between database server 202 and application server 204. If a failure of 
connection 330A occurs, connection 330B may be utilized to communicate between 
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servers 202 and 204. By incorporating secondary connection 330B, the single point of 
failure that exists in a single connection network as described above is eliminated. 
However, in order to avoid interruptions in network service, secondary connection 330B 
must be able to assume the functions of primary connection 330A in a very short period 
5 of time. If a failure of primary connection 330A occurs and secondary connection 330B 
does not assume the functions of primary connection 330A very rapidly, users will 
experience network service interruption. In the following discussion, a failover method is 
described which provides for high availability and serviceability. 

10 Network Protocol and Addressing Overview 

A well known communications protocol for networking is the Transmission 
Control Protocol/Internet Protocol (TCP/IP) communications protocol. In general, each 
host connected to a network has an assigned logical address, or Internet Protocol (IP) 

15 address. When one host wishes to convey data to another host, packets are formed which 
contain the data and the IP address of the destination host and the packets are then 
delivered to the desired destination. Necessarily, each host must have a unique IP address. 
Each IP address is a sequence of digits comprised of two basic parts, a network ID and a 
host ID. The network ID identifies devices which are located on the same physical 

20 network. The host ID is used to identify a particular host within a network. A second 

address which may be associated with a host is a subnet mask. A subnet mask is used to 
divide a larger network into smaller sub-networks. Frequently, network administration 
may be simplified by carefully defining sub-networks for a network. A subnet mask 
includes a sequence of digits similar to an IP address and is used to indicate which part of 

25 an IP address corresponds to a network ID and which part corresponds to host ID. A third 
address used in the IP protocol is a broadcast address. The broadcast address may be used 
by a host to convey data to all other hosts on a network simultaneously, as opposed to 
sending data to a targeted host with its unique IP address. Finally, each host interface 
includes a physical, or Media Access Control (MAC), address. The MAC address is 
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typically configured at the time of manufacture of the interface. In one embodiment, 
interface MAC addresses may be specified by the host of which they are a part. In such a 
case, a host with multiple interfaces may specify the same MAC address for all interfaces. 
In an alternative embodiment, each interface on a given host may have a different MAC 
address. Because packet delivery is handled at the physical layer and IP addresses are 
only logical addresses, IP addresses must be converted to corresponding MAC addresses. 
IP uses the well known Address Resolution Protocol (ARP) to build a table in memory 
which maps IP addresses to corresponding MAC addresses. By referencing this table, 
physical addresses corresponding to IP addresses may be determined. 

In the UNIX operating system, network interface configuration may be done using 
the well known "ifconfig" command. Among other things, the ifconfig command allows 
the configuration of an interface's IP address, subnet mask, and broadcast address. The 
basic syntax for ifconfig is typically as follows: 

ifconfig [interface] [command] [address] [options] 

where "interface" is the name of the interface, "command" includes a variety of command 
which depends on the particular version of UNIX being run, "address" is either a host 
name or address expressed in typical dot notation, and "options" may be any of a number 
of configuration options. Some of the options which may be specified are "up", "down", 
"netmask", "broadcast", and "plumb". The plumb option is used to setup and open the 
streams necessary for TCP/IP to use the interface. The broadcast option is used to 
configure a broadcast address. The netmask option is used to establish the subnet mask 
for the interface. The down option prevents the system from communicating through the 
interface. Finally, the up option enables the interface for communication. For example, 
the following command may be used to assign the IP address 172.16.1.2 to the interface 
named leO: 



ifconfig leO inet 172.16.1.2. 

Another well known utility in UNIX is the "ping" command. The ping command 
uses timed IP/ICMP ECHOJRJEQUEST and ECHO_REPLY packets to probe the 
5 "distance" to a target machine. By examining packets received from the target machine, 
communication latency may be determined. In one embodiment, the ping utility may be 
used to determine if a particular network connection is operational. This use is sometimes 
referred to as a "heartbeat", for it indicates whether or not a particular node or connection 
is "alive". For example, one node on a network may use the ping command to 

1 0 periodically send requests to another node over a particular network interface. As long as 
a corresponding reply is received, it may be assumed that the connection is working. If no 
reply is received, the connection may no longer be operational and corrective action may 
be required. In addition to the ping utility, other similar types of request/reply 
communication utilities may be used to test network connections as well. As an 

15 alternative to using the ping utility, some NICs are configured to maintain a status of the 
network connection to which they are connected. Such a status may be indicated by a 
status register within the NIC itself. By polling this status register, such as through device 
driver calls, a node may detect whether or not a particular connection is operational. By 
utilizing a network interface configuration utility such as "ifconfig", and monitoring a 

20 network connection, a mechanism for maintaining high availability may be created as 
described by the following. 

Fig. 4 is a diagram of one embodiment of a computer network including host 402, 
host 404, primary connection 450 and secondary connection 452. Host 402 includes 
25 interfaces hmeO and hmel, NIC 420, NIC 422, and failover mechanism 410A. In one 
embodiment, failover mechanism 410 includes software running at the application layer 
of the communications protocol. Host 404 includes NIC 440, NIC 442 and failover 
mechanism 41 0B. Interface hmeO is a primary interface and hmel is a secondary 
interface. NIC 420 is coupled to NIC 440 via connection 430 and NIC 422 is coupled to 
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NIC 442 via connection 432. In one embodiment, NIC 420, NIC 422, NIC 440 and NIC 
442 are Ethernet adapters. Fast Ethernet, Gigabit Ethernet, or any other suitable network 
adapters may be utilized as well. In order to support the TCP/IP protocol, network 
adapters such as 420 and 422 may be configured using the ifconfig utility as described 
above. Primary interface hmeO is configured with an IP address of 129.1.1.1, a Netmask 
of 255.255.255.0, a Broadcast address of 129.1.1.255 and a MAC address of 08-A0-20- 
CD-10-53. Secondary interface hmel is "cold", or "unplumbed", and is not configured. 
The interface corresponding to NIC 440 is configured with an IP Address of 129.1.1.2, 
Netmask of 255.255.255.0, Broadcast address of 129.1.1.255, and MAC address of 08- 
A0-20-CD-10-54. The interface corresponding to NIC 442 is not plumbed. In the 
embodiment shown, the MAC addresses of NIC 420 and NIC 422 are determined by host 
402 and are the same. The MAC addresses of NIC 440 and 442 may determined by host 
404 and are the same. 

In one embodiment, failover mechanism 4 1 OA is configured to monitor primary 
connection 450 by utilizing a heartbeat 460. Failover mechanism 410A monitors this 
heartbeat 460 by periodically "pinging" host 404 by sending packets over NIC 420 to 
host 404 via connection 430. Host 404 replies to these packets received from host 402 by 
sending reply packets over NIC 440 to host 402 via connection 430. If host 402 receives a 
reply from host 404, failover mechanism 41 OA is configured to detect primary connection 
450 is operational. However, if failover mechanism 41 OA detects a failure in heartbeat 
460 (e.g., no reply packets received from host 404), failover mechanism 41 OA is 
configured to perform a failover to secondary connection 452. In a similar manner, 
failover mechanism 41 0B may be configured to monitor primary connection 450 by 
monitoring heartbeat 460. Failover mechanism may be configured to detect a failure in 
primary connection 450 when no packets are received from host 402 via connection 450. 
In order to perform failover from connection 450 to connection 452, failover mechanism 
41 OA may take down NIC 420, stop monitoring primary connection 450 and utilize the 
ifconfig command to plumb NIC 422 and configure NIC 422 with the parameters 



corresponding to NIC 420. Likewise, failover mechanism 41 OB may detect a failure in 
primary connection 450 and perform a failover to secondary connection 452. Failover 
mechanism 41 OA may include the following commands to perform the failover: 

pausemonitor 
ifconfig hmeO unplumb 
ifconfig hmel plumb 

ifconfig hmel inet 129.1.1.1 netmask 255.255.255.0 broadcast 129.1.1.255 up 
resume_monitor 

Upon detecting a failure, the failover routine is called and monitoring of the primary 
connection 450 is paused. The first ifconfig command above "unplumbs" interface hmeO 
so that host 402 will not attempt to use it for communications. The second ifconfig 
command above "plumbs" interface hmel and sets up the TCP/IP streams needed for host 
402 to use this interface. The third ifconfig command above configures interface hmel 
with the parameters corresponding to hmeO. Interface hmel is configured with the IP 
address 129.1.1.1, subnet mask 255.255.255.0, and the broadcast address 129.1.1.255. 
The "up" parameter brings interface hmel up so that it may be used by host 402 for 
communications. Upon completion of the failover to the new interface, monitoring of the 
network connection is resumed. Because NIC 420 and NIC 422 have the same MAC 
address, there is no need to transfer the MAC address of NIC 420 to NIC 422. Failover 
mechanism 41 0B may utilize a similar set of commands to perform failover to secondary 
connection 452. Advantageously, the connection failure is transparent to clients of the 
node and the network connection is maintained. Further, because failover mechanism 410 
operates at the application layer, it is not necessary to modify the operating system, 
drivers, or network software of host 402 or host 404. Advantageously, failover 
mechanism 410 is portable and may be easily implemented in a wide variety of systems 
running a variety of operating systems. In one embodiment, upon completing a failover to 
secondary connection 452, monitoring of secondary connection 452 begins and a failover 
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to primary connection 450 may be initiated in the event secondary connection 452 fails. 
Further, in one embodiment it may be necessary to "unplumb" primary connection 450 
prior to plumbing secondary connection 452. 

In one embodiment, failover mechanism 410 may include a configuration file 
specifying a delay parameter. The delay parameter may indicate a specified period of 
time a host is to wait before checking network connectivity. In this manner, both hosts 
402 and 404 may have time to perform failover before one begins communicating again. 
Were one host to begin communicating prior to completion of failover by both, the 
network connection may appear inoperable to the transmitting host and another failover 
may be initiated. 

Fig. 5 is a flowchart illustrating a failover method. Initially, a secondary network 
connection is added to a two node LAN (block 502). This newly added connection 
includes an additional NIC added to each node and a crossover cable coupling the new 
NICs together. The primary network connection is then monitored for failure (block 504). 
Such monitoring may include using a ping utility or polling a status register of the 
corresponding NIC. If no failure of the primary connection is detected (decision block 
506), monitoring of the primary connection continues (block 504). If a failure of the 
primary connection is detected (decision block 506), monitoring of the primary 
connection is stopped (block 508), the primary connection is brought down (block 510) 
and the secondary connection is plumbed (block 512). Finally, the secondary connection 
is configured with the parameters of the primary connection and is brought up (block 
514), and monitoring of the newly enabled connection is started (block 516). 

Firewalls 

A firewall is a well known technology which provides for restricted 
communications between separate networks. Firewalls are frequently used by an 



enterprise to control access of those on an outside network, such as the Internet, to the 
enterprise's inner network. Firewalls may also be used to protect some parts of an inner 
network from other parts of an inner network. Firewalls typically use a complex set of 
predefined rules which are applied to communications between networks. Firewalls 
5 generally attempt to match an incoming or outgoing communication and service with one 
of the predefined rules. If a rule match is found, the communication may be either 
accepted or rejected. 

Fig. 6 is a diagram illustrating a computer network with a firewall. Fig. 6 includes 
an external network 690, firewall 602, switch 604, and internal network 692. Firewall 602 
includes NIC 620 and NIC 680. Switch 604 includes port 640 and port 682. External 
network 690 is coupled to firewall 602 through NIC 680. Internal network 692 is coupled 
to switch 604 through NIC 682. NIC 620 is coupled to port 640 via interconnect 630. In 
general, firewall 602 controls communications between the external network 690 and the 
internal network 692. In a typical configuration, internal network 692 may be a LAN 
within a company and external network 690 may represent a network outside the 
company. By placing the firewall between the external network 690 and the internal 
network 692, communications may be controlled and security enhanced. As shown in fig. 
6, there is a single connection 650 between firewall 602 and switch 604. If connection 
650 were to fail, communication between internal network 692 and external network 690 
would be lost. Connection 650 represents a single point of failure. To eliminate this 
single point of failure and provide for high availability, a failover mechanism as 
discussed below may be utilized. 

25 Fig. 7 is a diagram illustrating a computer network with a firewall and an added 

redundant connection. Fig. 7 includes an external network 690, firewall 602, switch 604, 
switch 605 and internal network 692. Firewall 602 includes failover mechanism 610, NIC 
620, NIC 622 and NIC 680. Switch 604 includes port 640 and port 682. Switch 605 
includes ports 642 and 684. External network 690 is coupled to firewall 602 through NIC 
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680. Internal network 692 is coupled to switch 604 through port 682 and switch 605 
through port 684. NIC 620 is coupled to port 640 via interconnect 630. NIC 622 is 
coupled to port 642 via interconnect 632. Switches 604 and 605 are coupled to each 
other. Connection 650 represents a primary connection between firewall 602 and switch 
5 604. Connection 652 represents a secondary connection between firewall 602 and switch 
604. 



Because of the security oriented nature of firewalls, certain operations which may 
be permitted on a non-firewall server may not be permitted on a firewall server. For 
10 example, in certain firewall implementations, all interfaces must be defined when the 
system is started and all interfaces must be kept defined at all times in order to prevent 
security breaches. In such an implementation, plumbing an interface after boot may be 
seen as a hostile act by a firewall server and may cause the firewall server to shut down. 
;£i Consequently, if the interface corresponding to NIC 622 is plumbed after firewall 602 has 

CO 1 5 started, firewall 602 may shut down all communications between external network 690 

| i; y and internal network 692. Further, if interfaces corresponding to both NIC 620 and 622 

^ are plumbed at startup, they may both have the same MAC address which may not be 

□ permissible. 

20 In order to implement a failover mechanism for a system such as firewall 602, the 

^ j following method may be used. A redundant connection 652 is added as described above. 

On startup of firewall 602, failover mechanism 610 defines all interfaces. Failover 
mechanism 610 plumbs and configures NIC 620, NIC 622, and NIC 680. NIC 620 and 
NIC 680 are plumbed and configured with the actual parameters to be used in network 

25 communication. However, NIC 622 is plumbed and configured with dummy IP addresses 
and a dummy MAC address. Upon configuring NIC 622 with the dummy parameters, 
NIC 622 is immediately brought down by failover mechanism 610. In this manner, all 
interfaces may be defined at startup, but there is no conflict between IP addresses or 
MAC addresses. In this case, the secondary connection may be deemed a "warm" standby 



as it is already plumbed and configured, but not "up" or active. As shown in fig. 7, the 
following interface parameters are indicated: 



Interface 


NIC 620 


NIC 622 


Parameters 


IP Address: 129.1.1.1 
Netmask : 255.255.255.0 
Broadcast: 129.1.1.255 
MAC : 08-A0-20-CD-10-53 


IP Address: 129.1.1.5 
Netmask : 255.255.255.0 
Broadcast : 129.1.1.255 
MAC : 08-A0-20-CD- 10-55 



5 In order to perform the failover from primary connection 650 to secondary 

connection 652, a method similar to that discussed in fig. 4 may be used. Utilizing a 
heartbeat or similar mechanism, a failure of the primary connection 650 may be detected 
by failover mechanism 610. Assuming failover mechanism 610 initially detects the 
failure of primary connection 650, failover mechanism 610 stops monitoring primary 

10 connection 650. NIC 620 is then brought down. NIC 622 is then configured using 
ifconfig with the parameters which are currently assigned to NIC 620 and NIC 620 is 
configured with the dummy parameters which were associated with NIC 622. NIC 622 is 
then brought up. Failover mechanism 610 may then monitor connection 652 as the new 
primary connection and have connection 650 serve as the new secondary connection. In 

15 the event connection 652 fails, a failover to connection 650 may be done. In alternate 
embodiments, firewall 602 may be coupled to another host which has a failover 
mechanism. In such a case, as discussed above, a configuration file may be utilized which 
includes a delay parameter to ensure both firewall 602 and the coupled host have 
completed failover before resuming communications. Advantageously, network 

20 configurations which include security mechanisms such as firewalls may include the 
above described failover mechanism and high availability may be attained. 

Increasing Bandwidth 
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Because of the ever increasing demands placed on networks today, ways of 
increasing network bandwidth are of frequent concern. While Fast Ethernet and Gigabit 
Ethernet may serve to improve performance, the use of such technologies necessitate the 
need for an even greater increase in backbone capacity. One well known technique used 
5 to increase bandwidth is called "trunking". Trunking is a technology which may provide 
dramatic increases in network performance. Using trunking technology, multiple ports 
may be combined into a single logical port creating a single, high speed, logical link. Fig. 
8 is an illustration of a network utilizing trunking to boost throughput and reduce network 
latency. Included in fig. 8 are host 802, switch 820, LAN 1 830A and LAN 2 830B. Host 
10 802 includes a four port NIC 880. NIC 880 is coupled to switch 820 via links 810A- 

810D. In one embodiment, each of links 810A-810D represent Fast Ethernet connections. 
Switch 820 is further coupled to LAN 1 830A and LAN 2 830B. 

While NIC 880 has four ports, all four ports may be configured to act as a single 
logical port. Consequently, switch 820 may utilize any of the four ports to convey data to 
host 802. In one embodiment, connections 810 may be utilized in a round robin order. In 
this manner, links 810 may be viewed as a single connection with four times the capacity 
of any individual connection 810A-810D. Because all four ports of NIC 880 may be 
viewed as a single logical port, data communicated from LAN 1 830A or LAN 2 830B to 
host 802 has four possible connections upon which to be conveyed. In addition to 
increasing the bandwidth between Host 802 and Switch 820, the availability of additional 
connections provides for redundancy as well. If one of connections 810 were to fail, the 
failed connection may be disabled and the remaining connections may continue service. 
In this manner, availability of the connection between Host 802 and LANs 830 may be 
maintained. 

While trunking provides for increased bandwidth and redundancy, there still 
exists single points of failure. Because all four ports of NIC 880 must be connected to the 
same switch 820, a single point of failure is introduced. For example, if Switch 820 fails, 
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communication between Host 802 and LANs 830 will be lost. Further, if NIC 880 were to 
fail, communication between Host 802 and LANs 830 would be lost. In order to eliminate 
single points of failure and maintain high availability, a failover method as described in 
the following may be introduced. Fig. 9 is a diagram of a network which includes 
5 trunking and a failover mechanism. Included in fig. 9 are host 802, switches 820, 920, 
930A, 930B, and LANs 830A and 830B. Host 802 includes NIC 880 coupled to switch 
820, and NIC 980 coupled to switch 920. Switch 820 and Switch 920 are further coupled 
to switches 930. LANs 830 are each coupled to both switch 930A and 930B. Host 802 
also includes failover mechanism 990. NIC 880, connection 810, and Switch 820 
10 represent a primary connection 982. NIC 980, connection 910, and Switch 920 represent 
a secondary connection 984. 

In ordinary operation, primary connection 982 handles all network traffic between 
host 802 and LANs 830. In one embodiment, NIC 980 is initially unplumbed and 
unconfigured. NIC 880 may be configured as described above using the ifconfig 
command. Failover mechanism 990 may be configured to monitor primary connection 
810 using a heartbeat, status register polling or other similar technique. In one 
embodiment, failover mechanism 990 is configured to monitor the operational status of 
each of the four ports 810A-810D of NIC 880. If failover mechanism 990 detects a failure 
of one of connections 810A-810D, a failover to secondary connection 984 may be 
commenced. Primary NIC 880 trO is configured with an IP address of 129.1. LI, a 
Netmask of 255.255.255.0 and a Broadcast address of 129.1.1.255. Secondary NIC 980 is 
unplumbed. Both NIC 880 and NIC 980 have a MAC address of 08-A0-20-CD-10-53. 
Assuming NIC 880 has the designation trO and NIC 980 has the designation trl, the 
following commands may be used to perform the failover. 

trconfig stop trO 
trconfig define trl 

tr_config trl 129.1.1.1 netmask 255.255.255.0 broadcast 129.1.1.255 up 
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tr_config start_trunk_monitor 

Upon detecting a failure, tr_config stop trO stops failover mechanism 990 from 
monitoring primary connection 982 and unplumbs trO. The second tr_config command 
5 above "plumbs" interface trl and sets up the TCP/IP streams needed for host 802 to use 
this interface. The third tr_config command above configures interface trl with the 
parameters corresponding to trO, Interface trl is configured with the IP address 129.1.1.1, 
subnet mask 255.255.255.0, and the broadcast address 129.1.1.255. The "up" parameter 
brings interface trl up so that it may be used by host 802 for communications. Finally, 

10 the failover switch procedure returns and failover mechanism 990 begins monitoring 
secondary connection 984. Because NIC 880 and NIC 980 have the same MAC address, 
there is no need to transfer the MAC address of NIC 880 to NIC 980. Advantageously, 
the connection failure is transparent to clients of the node and the network connection is 
maintained. Further, because failover mechanism 990 operates at the application layer, it 

15 is not necessary to modify the operating system, drivers, or network software of host 802. 
Advantageously, failover mechanism 990 is portable and may be easily implemented in a 
wide variety of systems running a variety of operating systems. In one embodiment, upon 
completing a failover to secondary connection 984, monitoring of secondary connection 
984 begins and a failover to primary connection 982 may be initiated in the event 

20 secondary connection 984 fails. 

In an alternative embodiment, failover mechanism 990 may be configured to 
failover only when the performance level of a connection falls below a predetermined 
threshold. For example, while monitoring primary connection 982, failover mechanism 
25 990 may detect a failure of connection 81 OA only. Trunking capability may then disable 
connection 81 OA and utilize the remaining three connections, 810B-810D, to continue 
communications. If the level of performance of primary connection 982 still exceeds a 
particular threshold, failover mechanism 990 may not perform a failover. If then failover 
mechanism 990 detects a failure of a second connection, 810B, and the performance of 
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the primary connection 982 falls below the threshold, a failover to secondary connection 
984 may be performed. Advantageously, failover may be performed only when deemed 
necessary. In one embodiment, failover mechanism 990 may be configured to read a 
configuration file to determine the failover threshold. For example, a threshold parameter 
5 may be set to 50% of connection capacity. Failover mechanism 990 may then perform a 
failover if 50% or more of the connections in a trunked interface fail. 

Fig, 10 is a flowchart illustrating a method for failover in a network utilizing 
trunking. Initially, a secondary network connection is added to a host (block 1002). This 

10 newly added connection includes an additional multi-ported NIC coupled to a trunking 
enabled switch. The primary network connection is then monitored for failure (block 
1004). Such monitoring may include using a ping utility or polling a status register of the 
corresponding NIC. If no failure of the primary connection is detected (decision block 
1006), monitoring of the primary connection continues (block 1004). If a failure of one or 

15 more of the trunking connections in the primary connection is detected (decision block 
1006), a determination is made as to whether a performance related threshold parameter 
has been exceeded (decision block 1007). Such a determination may include reading a 
threshold parameter from a configuration file as described above. If the threshold 
parameter has not been exceeded, flow returns to block 1004, If the threshold parameter 

20 has been exceeded (decision block 1 007), monitoring of the primary connection is paused 
(block 1008), the primary connection is brought down (block 1010) and the secondary 
connection is plumbed (block 1012). Finally, the secondary connection is configured with 
the parameters of the primary connection and brought up (block 1014), and monitoring of 
the newly enabled connection is started (block 1016). 

25 

In addition to the above described four port (quad) trunking technology, the 
described failover mechanism may be applied to other trunking technologies as well. For 
example, two Gigabit Ethernet ports may be combined to form a trunk. In the event one 
of the two Gigabit Ethernet connections fails, a failover may be performed to a redundant 
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Gigabit Ethernet trunk. Further, the above description assumes a secondary connection is 
initially unplumbed. However, in the case of a host which utilizes a firewall technology, 
or other technology requiring all interfaces be defined at startup, a warm standby as 
described above may be utilized. 

Alternate Pathing 

Alternate pathing is a technology which provides for redundancy to storage in 
case of a failed I/O controller. In addition to providing for recovery after failure, alternate 
pathing may also be used to support dynamic reconfiguration. Dynamic reconfiguration is 
used to logically attach and detach system boards from a running operating system, 
without the need to reboot. Fig. 11 is an illustration of a system utilizing alternate pathing 
technology. Included in fig. 1 1 are host 1 102, I/O controllers 1 1 10A and 1 HOB, and disk 
array 1 140. I/O controller 1 1 10A is coupled to disk array 1 140 via interconnect 1 120A. 
I/O controller 1 1 10B is coupled to disk array 1 140 via interconnect 1 120B. During 
ordinary operation, I/O controller 1 1 10A is used for all I/O transactions between host 
1 102 and disk array 1 140. In the event I/O controller 1 1 10A fails, I/O controller 1 1 10B is 
utilized for transactions between host 1 102 and disk array 1 140. On the other hand, the 
dynamic reconfiguration feature of alternate pathing provides that if the system 
administrator desires to remove or replace I/O controller 1 1 10A without stopping and 
restarting host 1 1 02, I/O controller 1 1 1 0B may be enabled and I/O controller 1 1 1 OA 
removed without the need to reboot host 1 102. 

Alternate pathing may also be used with network interfaces. When used with 
network interfaces, alternate pathing utilizes "metanetworking" to support alternate paths. 
A metanetwork is a logical construct that enables you to access a network by using one of 
two physical paths without having to explicitly reference either path. You reference a 
metanetwork using a metanetwork interface name such as "metherl" as will be discussed 
below. However, while alternate pathing may provide for dynamic reconfiguration of 



network interface cards, automatic failover for networks after boot is not supported. If 
alternate pathing is used with network interfaces and a failure occurs, a manual failover is 
required. Because of the necessary delay involved in a manual failover, network 
operations will be interrupted. By adding a failover mechanism to a system with alternate 
5 pathing, automatic failover and high availability may be attained. 

Fig. 12 is a diagram illustrating a system with alternate pathing and a failover 
mechanism. Shown in fig. 12 is a host 1202 coupled to an Ethernet interconnect 1270. 
Host 1202 includes NIC 1220, NIC 1222 and failover mechanism 1210. NIC 1220 is 
10 coupled to Ethernet 1270 via connection 1230 and NIC 1222 is coupled to Ethernet 1270 
via connection 1232. Connection 1250 represents a primary path and connection 1252 
represents an alternate path. While NIC 1220 is named hmeO and NIC 1222 is named 
^ hmel, alternate pathing provides that host 1202 may refer to either path using the 

{ M metaname "metherl" Primary path 1250 is active and alternate path 1252 is inactive. 

Js 15 Consequently, all network communications to and from host 1202 occur on primary path 

i i: U 1250. Failover mechanism 1210 is configured to monitor primary path 1250 utilizing a 

h ^ heartbeat, polled status register, or other technique. When a failure of primary path 1250 

3 is detected, failover mechanism 1210 may then perform an automatic failover to alternate 

! ( s :; 

p path 1252. In one embodiment, failover mechanism 1210 may utilize the command 

,S 20 "apconfig -P <metaname> -a interface name>" to perform the failover, where 

,3513. 

W "metaname" is the metaname associated with the logical network connection and "hmel" 

is the name corresponding to the physical interface. For example, in the system of fig. 12, 
the command "apconfig -P metherO -a hmel" may be used to activate alternate path 
1252 and inactivate primary path 1250. Alternatively, the "ifconfig" command may be 
25 used to configure and bring up the alternate path as described above. Advantageously, 
high availability and serviceability may be attained in a network with alternate pathing. 
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Failover with Virtual IP Addresses 



Many times an enterprise may wish to have multiple servers, but does not have 
the budget to invest in additional hardware. A typical example of such a case is an 
5 Internet Service Provider (ISP) who supplies multiple Web servers, but does not wish to 
provide a separate computer system for every server. One well known method of enabling 
multiple servers on a single machine is through the use of virtual IP addresses. A virtual 
IP address is a logical address created by a system administrator which typically 
corresponds to a real interface address. Because the virtual IP address is a logical address, 

10 multiple virtual IP addresses may be created which correspond to the same interface. 

Consequently, multiple Web servers with unique virtual IP addresses may be hosted on a 
single physical computer. In addition to providing the ability to have multiple servers on 
a single computer, virtual IP addresses may be used to provide for physical server 
redundancy and load balancing. For example, if a failure of an interface on a primary 

1 5 server occurs, a failover to a backup server may be utilized to maintain high availability. 

Fig. 13 is an example of a computer system utilizing virtual IP addresses. 
Included in fig. 13 are hosts 1302A and 1302B. Host 1302A includes failover mechanism 
1304A, NIC 1310A, and NIC 1310B. Host 1302B includes failover mechanism 1304B, 

20 NIC 1312A, and NIC 1312B. NIC 1310A is coupled to Switch 1320A via interconnect 
1330. NIC 1310B is coupled to Switch 1320B via interconnect 1334. NIC 1312A is 
coupled to Switch 1320A via interconnect 1332. NIC 1312B is coupled to Switch 1320B 
via interconnect 1336. Switch 1320A is coupled to Switch 1320B via interconnect 1340. 
NIC 1310A and NIC 1312A are named hmeO. NIC 1310B and NIC 1312B are named 

25 hmel.NIC 1310A has an interface address of 129.126.1.1 and NIC 1312Ahasan 

interface address of 129.126.1.2. In one embodiment, NICs 1310B and 1312B serve as 
backup interfaces. Backup interfaces NIC 1310B and 1312B may be either cold or warm 
standby interfaces as described above. As cold interfaces, NIC 1310B and 1312B may 
have the same MAC address as primary interfaces NIC 1310A and 1312A, respectively. 
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If backup interfaces 1310B and 1312B are warm, unique MAC addresses and dummy 
parameters may be utilized as previously discussed. 

In order to create virtual IP addresses for interfaces 1310 and 1312, a command 
with the following syntax may be used: 

ifconfig IF:N ip-address up 

where EF represents the actual interface name, N is a number typically between 1 and 255, 
and ip-address is the desired virtual IP address. Using the above syntax, the following 
command may be used to create virtual interfaces for host 1302A of fig. 13: 

ifconfig hmeO:l 129,126.1.3 up 
ifconfig hme0:2 129.126.1.5 up 
ifconfig hme0:3 129.126.1.7 up 

Each of the three commands above defines and brings up a virtual address which 
corresponds to interface hmeO of host 1302A. When any of the above virtual addresses is 
specified, NIC 1310A recognizes the specified address as its own. Likewise the following 
commands may be used to create virtual addresses for NIC 1312A of host 1302B. 

ifconfig hmeOrl 129.126.1.4 up 
ifconfig hme0:2 129.126.1.6 up 
ifconfig hme0:3 129.126.1.8 up 

Each of the three commands above defines and brings up a virtual address which 
corresponds to interface hmeO of host 1302B. When any of the above virtual addresses is 
specified, NIC 1312A recognizes the specified address as its own. 



Failover mechanism 1304A is configured to monitor NIC 1310A and 1310B. 
Monitoring may be done using a heartbeat mechanism, status register polling, or other 
similar mechanism. In the event failover mechanism 1304A detects a failure in 
communications through NIC 1310A, a failover to NIC 131 OB may be performed. To 
5 perform failover from NIC 1 3 10A to NIC 1 3 12 A, failover mechanism transfers the 
configuration parameters of NIC 131 OA to NIC 1310B. Included in the parameters 
transferred from NIC 131 OA to NIC 131 OB is the interface address 129.126.1.1. 
However, because there are a number of virtual addresses associated with NIC 131 OA, 
they must be transferred as well. Failover mechanism 1304 is configured to capture the 

10 current virtual IP address configuration of the interface 1331 OA at the time of failure and 
configure the backup interface 131 OB with the captured parameters. To capture the virtual 
IP address configuration of an interface, failover mechanism 1304 may utilize a 
command such as ifconfig or ipconfig. Advantageously, all network IP addresses are 
failed over and high availability is maintained. In one embodiment, failover mechanism 

15 1304 is configured to capture and store the virtual address configurations of the local 
interfaces upon startup. If upon failure of an interface connection, failover mechanism 
1304 is unable to capture the current virtual address configuration, the originally captured 
startup configuration is used for configuring the backup interface. Similarly, failover 
mechanism 1304B may be configured to perform a failover from NIC 1312A to NIC 

20 1312B. 

In addition to the embodiment described in fig. 13, virtual IP addresses may be 
used in conjunction with a myriad of configurations, including the embodiments 
described above. When virtual addresses are employed, the included failover mechanism 
25 may be configured to failover the virtual addresses as well. Further, while the above 
description refers to commands "ipconfig" and "ifconfig", the particular command and 
syntax used on a particular system may vary. In addition, references to a network "node" 
or network "device" may include computers, routers, bridges, switches or other similar 
devices. Finally, the above described embodiments are only intended to be exemplary. 

25 



Other embodiments, including variations and combinations of the above described 
features, are contemplated as well. 
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WHAT IS CLAIMED IS: 



1 . A method for maintaining high availability in a two node computer network, wherein 
said computer network includes alternate pathing, said method comprising: 

adding a first High Availability Networking (HAnet) mechanism to a first node of 
said two node computer network, wherein said HAnet mechanism operates 
within an Application layer of an architectural model of said computer 
network; 

monitoring a first network path of said computer network, wherein said first 

network path is between said first node and said second node, wherein said 
first network path corresponds to a primary path of said alternate pathing, 
and wherein said monitoring is by said first HAnet mechanism; 

detecting a failure of said first network path, wherein said failure is detected by 
said first HAnet mechanism; and 

performing a failover from said first network path to a second network path of 
said first node in response to detecting said failure, wherein said second 
network path corresponds to an alternate path of said alternate pathing, 
wherein said failover is performed by said first HAnet mechanism. 

2. The method of claim 1 , wherein said HAnet mechanism comprises computer 
software. 

3. The method of claim 1, wherein said first network path comprises a first network 
interface card in said first node coupled to a second network interface card in said 
second node, wherein said first network interface card and said second network 



interface card are coupled via a first crossover cable, and wherein said second 
network path comprises a third network interface card in said first node coupled to a 
fourth network interface card in said second node, wherein said third network 
interface card and said fourth network interface card are coupled via a second 
crossover cable. 

4. The method of claim 1, wherein said first node and said second node comprise 
servers. 

5. The method of claim 1, wherein said monitoring comprises said first node monitoring 
a heartbeat of said first network path. 

6. The method of claim 5, wherein said heartbeat comprises said first node conveying 
request packets to said second node and said second node conveying reply packets to 
said first node, where said request packets and said reply packets are conveyed via 
said first network path. 

7. The method of claim 1, wherein said monitoring comprises polling a status register 
corresponding to said first network path. 

8. The method of claim 7, wherein said status register is in a network interface card. 

9. The method of claim 6, wherein detecting said failure of said first network path 
comprises detecting said reply packets are not received from said second node. 

10. The method of claim 7, wherein said detecting said failure of said first network path 
comprises detecting said status register indicates said first network path has failed. 
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11. The method of claim 1, wherein performing said failover comprises disabling said 
first network path and enabling said second network path. 

12. The method of claim 3, wherein said failover comprises: 

pausing monitoring of said first network interface card; 
disabling said first network interface card; 
plumbing said third network interface card; 

configuring said third network interface card with interface address parameters 
corresponding to said first network interface card; and 

enabling said third network interface card; and 

resuming monitoring, wherein said second network path is monitored. 

13. The method of claim 12, wherein said interface address parameters include an IP 
address, broadcast address, netmask address and MAC address. 

14. The method of claim 13, wherein said interface address parameters further include a 
virtual IP address. 

15. The method of claim 12, wherein said enabling is subsequent to the expiration of a 
time delay, wherein said time delay corresponds to a delay parameter read from a 
configuration file. 



16. The method of claim 3, wherein said first network interface card, said second network 
interface card, said third network interface card, and said fourth network interface 
card are Ethernet adapters. 

17. The method of claim 3, wherein said first network interface card, said second network 
interface card, said third network interface card, and said fourth network interface 
card are Gigabit Ethernet adapters. 

18. The method of claim 1, further comprising: 

adding a second High Availability Networking (HAnet) mechanism to said 
second node of said two node computer network, wherein said second 
HAnet mechanism operates within an Application layer of an architectural 
model of said computer network; 

monitoring said first network path of said computer network, wherein said 
monitoring is by said second HAnet mechanism; 

detecting said failure of said first network path, wherein said failure is detected by 
said second HAnet mechanism; and 

performing a failover from said first network path to said second network path in 
response to detecting said failure, wherein said failover is performed by 
said second HAnet mechanism. 

19. A network node configured to support alternate pathing, said network node 
comprising: 
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a first network interface, wherein said first network interface is coupled to a first 
network path, wherein said first network path corresponds to a primary 
path of said alternate pathing; 

a second network interface, wherein said second network interface is coupled to 
an second network path, wherein said second path corresponds to an 
alternate path of said alternate pathing; 

a High Availability Networking (HAnet) mechanism, wherein said HAnet 

mechanism operates within an Application layer of an architectural model 
of said computer network, wherein said HAnet mechanism is configured 
to monitor said first network interface, and wherein said HAnet 
mechanism is configured to perform a failover from said first network 
interface to said second network interface in response to detecting a failure 
of said first network path. 

20. The network node of claim 19, wherein said HAnet mechanism is configured to 
monitor said first network interface by monitoring a heartbeat of a network 
connection corresponding to said first network interface. 

21 . The network node of claim 20, wherein said heartbeat comprises conveying request 
packets and receiving reply packets via said first network interface. 

22. The network node of claim 19, wherein said HAnet mechanism is configured to 
monitor said first network interface by polling a status register of said first network 
interface. 



23. The network node of claim 19, wherein said failover comprises disabling said first 
network interface and enabling said second network interface. 



24. The network node of claim 19, wherein said failover comprises: 



pausing monitoring of said first network interface; 

5 

disabling said first network interface; 

plumbing said second network interface; 

10 configuring said second network interface with interface address parameters 

corresponding to said first network interface; and 

enabling said second network interface; and 

15 resuming monitoring, wherein said second network interface is monitored. 

25. The network node of claim 24, wherein said interface address parameters include an 
IP address, broadcast address, netmask address and MAC address. 

20 26. The network node of claim 25, wherein said interface address parameters further 
include a virtual IP address. 

27. The network node of claim 19, wherein said first network interface and said second 
network interface comprise Ethernet adapters. 

25 

28. The network node of claim 19, wherein said first network interface and said second 
network interface comprise Gigabit Ethernet adapters. 
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29. A two node computer network configured to maintain high availability, wherein said 
computer network includes alternate pathing, said computer network comprising: 

a first node, wherein said first node includes a first High Availability Networking 
5 (HAnet) mechanism, wherein said first HAnet mechanism operates within 

an Application layer of an architectural model of said computer network; 

a second node; 

1 0 a second network path, wherein said second network path corresponds to an 

alternate path of said alternate pathing, wherein said second network path 
is between said first node and said second node; 

!:!,! a first network path, wherein said first network path corresponds to a primary path 

jffi 1 5 of said alternate pathing, wherein said first network path is between said 

I s ! : first node and said second node, and wherein said first HAnet mechanism 

J"' is configured to monitor said first network path, and wherein said first 

y HAnet mechanism is configured to perform a failover from said first 

Q network path to said second network path in response to detecting a failure 

i y 

□ 20 of said first network path. 

O 

30. The two node computer network of claim 29, wherein said first HAnet mechanism is 
configured to monitor said first network path by monitoring a heartbeat of said first 
network path. 

25 

31. The two node computer network of claim 30, wherein said heartbeat comprises 
conveying request packets and receiving reply packets via said first network path. 
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32. The two node computer network of claim 29, wherein said first HAnet mechanism is 
configured to monitor said first network path by polling a status register 
corresponding to a network interface of said first node. 

5 33. The two node computer network of claim 29, wherein said failover comprises 
disabling said first network path and enabling said second network path. 

34. The two node computer network of claim 29, wherein said failover comprises: 
1 0 pausing monitoring of said first network path; 

disabling said first network path; 
plumbing said second network path; 

configuring said second network path with interface address parameters 
corresponding to said first network path; and 

enabling said second network path; and 

resuming monitoring, wherein said second network path is monitored. 

35. The two node computer network of claim 34, wherein said interface address 
parameters include an IP address, broadcast address, and netmask address. 

25 

36. The two node computer network of claim 35, wherein said interface address 
parameters further include a virtual IP address. 
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37. The two node computer network of claim 29, wherein said first network interface and 
said second network interface comprise Ethernet adapters. 

38. The two node computer network of claim 29, wherein said first network interface and 
5 said second network interface comprise Gigabit Ethernet adapters. 

39. The two node computer network of claim 29, wherein said second node includes a 
second High Availability Networking (HAnet) mechanism, wherein said second 
HAnet mechanism operates within an Application layer of an architectural model of 

10 said computer network, wherein said second HAnet mechanism is configured to 

monitor said first network path, and wherein said HAnet mechanism is configured to 
detect said failure of said first network path, and wherein said HAnet mechanism is 
configured to perform a failover from said first network path to said second network 
path in response to detecting said failure. 
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ABSTRACT OF THE DISCLOSURE 

A method and mechanism operating within the Application layer of the 
5 architectural model for maintaining high availability in a two node computer network 
which utilizes alternate pathing. A failover mechanism operating within the Application 
layer monitors the original, primary network path. Upon detecting a failure of the primary 
path, the failover mechanism halts monitoring of the primary connection, configures the 
alternate network interface with the parameters of the primary network interface, and 
10 brings up the backup interface. 
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